Exploration univariée de chaque variable du jeu de données. Exploration par rapport à la cible. Nombre de valeur manquante.
Il y a 79 variables explicatives.
Il y a 23 variables nominale.
Il y a 23 variables ordinale.
Il y a 14 variables discrète.
Il y a 20 variables continue.
(j’ai essayé de traduire le libellé des variables rapidement …)
train[, summary(SalePrice)]
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 34900 130000 163000 180900 214000 755000
train[, summary(logSalePrice)]
## Min. 1st Qu. Median Mean 3rd Qu. Max.
## 10.46 11.78 12.00 12.02 12.27 13.53
Parfois certaines variables ont des NA’s “logique”, à voir s’il y a des valeurs manquantes “missing at random”. Supression des variables dans ce tableau n’ayant aucune valeurs manquantes dans les deux sets.
| set | variable | NA’s |
|---|---|---|
| train | PoolQC | 1453 |
| test | PoolQC | 1456 |
| train | MiscFeature | 1406 |
| test | MiscFeature | 1408 |
| train | Alley | 1369 |
| test | Alley | 1352 |
| train | Fence | 1179 |
| test | Fence | 1169 |
| train | SalePrice | 0 |
| test | SalePrice | 1459 |
| train | logSalePrice | 0 |
| test | logSalePrice | 1459 |
| train | FireplaceQu | 690 |
| test | FireplaceQu | 730 |
| train | LotFrontage | 259 |
| test | LotFrontage | 227 |
| train | GarageYrBlt | 81 |
| test | GarageYrBlt | 78 |
| train | GarageFinish | 81 |
| test | GarageFinish | 78 |
| train | GarageQual | 81 |
| test | GarageQual | 78 |
| train | GarageCond | 81 |
| test | GarageCond | 78 |
| train | GarageType | 81 |
| test | GarageType | 76 |
| train | BsmtCond | 37 |
| test | BsmtCond | 45 |
| train | BsmtExposure | 38 |
| test | BsmtExposure | 44 |
| train | BsmtQual | 37 |
| test | BsmtQual | 44 |
| train | BsmtFinType2 | 38 |
| test | BsmtFinType2 | 42 |
| train | BsmtFinType1 | 37 |
| test | BsmtFinType1 | 42 |
| train | MasVnrType | 8 |
| test | MasVnrType | 16 |
| train | MasVnrArea | 8 |
| test | MasVnrArea | 15 |
| train | MSZoning | 0 |
| test | MSZoning | 4 |
| train | Utilities | 0 |
| test | Utilities | 2 |
| train | BsmtFullBath | 0 |
| test | BsmtFullBath | 2 |
| train | BsmtHalfBath | 0 |
| test | BsmtHalfBath | 2 |
| train | Functional | 0 |
| test | Functional | 2 |
| train | Exterior1st | 0 |
| test | Exterior1st | 1 |
| train | Exterior2nd | 0 |
| test | Exterior2nd | 1 |
| train | BsmtFinSF1 | 0 |
| test | BsmtFinSF1 | 1 |
| train | BsmtFinSF2 | 0 |
| test | BsmtFinSF2 | 1 |
| train | BsmtUnfSF | 0 |
| test | BsmtUnfSF | 1 |
| train | TotalBsmtSF | 0 |
| test | TotalBsmtSF | 1 |
| train | Electrical | 1 |
| test | Electrical | 0 |
| train | KitchenQual | 0 |
| test | KitchenQual | 1 |
| train | GarageCars | 0 |
| test | GarageCars | 1 |
| train | GarageArea | 0 |
| test | GarageArea | 1 |
| train | SaleType | 0 |
| test | SaleType | 1 |
| MSSubClass | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| 120 | 1-STORY PUD (Planned Unit Development) - 1946 & NEWER | 87 | 95 | 5.96 | 6.51 |
| 150 | 1-1/2 STORY PUD - ALL AGES | 0 | 1 | 0.00 | 0.07 |
| 160 | 2-STORY PUD - 1946 & NEWER | 63 | 65 | 4.32 | 4.46 |
| 180 | PUD - MULTILEVEL - INCL SPLIT LEV/FOYER | 10 | 7 | 0.68 | 0.48 |
| 190 | 2 FAMILY CONVERSION - ALL STYLES AND AGES | 30 | 31 | 2.05 | 2.12 |
| 20 | 1-STORY 1946 & NEWER ALL STYLES | 536 | 543 | 36.71 | 37.22 |
| 30 | 1-STORY 1945 & OLDER | 69 | 70 | 4.73 | 4.80 |
| 40 | 1-STORY W/FINISHED ATTIC ALL AGES | 4 | 2 | 0.27 | 0.14 |
| 45 | 1-1/2 STORY - UNFINISHED ALL AGES | 12 | 6 | 0.82 | 0.41 |
| 50 | 1-1/2 STORY FINISHED ALL AGES | 144 | 143 | 9.86 | 9.80 |
| 60 | 2-STORY 1946 & NEWER | 299 | 276 | 20.48 | 18.92 |
| 70 | 2-STORY 1945 & OLDER | 60 | 68 | 4.11 | 4.66 |
| 75 | 2-1/2 STORY ALL AGES | 16 | 7 | 1.10 | 0.48 |
| 80 | SPLIT OR MULTI-LEVEL | 58 | 60 | 3.97 | 4.11 |
| 85 | SPLIT FOYER | 20 | 28 | 1.37 | 1.92 |
| 90 | DUPLEX - ALL STYLES AND AGES | 52 | 57 | 3.56 | 3.91 |
| Total | Total | 1460 | 1459 | 99.99 | 100.01 |
| MSZoning | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 0 | 4 | 0.00 | 0.27 |
| C (all) | Commercial | 10 | 15 | 0.68 | 1.03 |
| FV | Floating Village Residential | 65 | 74 | 4.45 | 5.07 |
| RH | Residential High Density | 16 | 10 | 1.10 | 0.69 |
| RL | Residential Low Density | 1151 | 1114 | 78.84 | 76.35 |
| RM | Residential Medium Density | 218 | 242 | 14.93 | 16.59 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 21 | 59 | 69 | 80 | 313 | 259 |
| test | 21 | 58 | 67 | 80 | 200 | 227 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 1300 | 7553.5 | 9478.5 | 11601.5 | 215245 | 0 |
| test | 1470 | 7391.0 | 9399.0 | 11517.5 | 56600 | 0 |
## `geom_smooth()` using method = 'gam'
| Street | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Grvl | Gravel | 6 | 6 | 0.41 | 0.41 |
| Pave | Paved | 1454 | 1453 | 99.59 | 99.59 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| Alley | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No alley access | 1369 | 1352 | 93.77 | 92.67 |
| Grvl | Gravel | 50 | 70 | 3.42 | 4.80 |
| Pave | Paved | 41 | 37 | 2.81 | 2.54 |
| Total | Total | 1460 | 1459 | 100.00 | 100.01 |
| LotShape | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| IR1 | Slightly irregular | 484 | 484 | 33.15 | 33.17 |
| IR2 | Moderately Irregular | 41 | 35 | 2.81 | 2.40 |
| IR3 | Irregular | 10 | 6 | 0.68 | 0.41 |
| Reg | Regular | 925 | 934 | 63.36 | 64.02 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| LandContour | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Bnk | Banked - Quick and significant rise from street grade to building | 63 | 54 | 4.32 | 3.70 |
| HLS | Hillside - Significant slope from side to side | 50 | 70 | 3.42 | 4.80 |
| Low | Depression | 36 | 24 | 2.47 | 1.64 |
| Lvl | Near Flat/Level | 1311 | 1311 | 89.79 | 89.86 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| Utilities | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 0 | 2 | 0.00 | 0.14 |
| AllPub | All public Utilities (E | 1459 | 1457 | 99.93 | 99.86 |
| NoSeWa | Electricity and Gas Only | 1 | 0 | 0.07 | 0.00 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| LotConfig | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Corner | Corner lot | 263 | 248 | 18.01 | 17.00 |
| CulDSac | Cul-de-sac | 94 | 82 | 6.44 | 5.62 |
| FR2 | Frontage on 2 sides of property | 47 | 38 | 3.22 | 2.60 |
| FR3 | Frontage on 3 sides of property | 4 | 10 | 0.27 | 0.69 |
| Inside | Inside lot | 1052 | 1081 | 72.05 | 74.09 |
| Total | Total | 1460 | 1459 | 99.99 | 100.00 |
| LandSlope | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Gtl | Gentle slope | 1382 | 1396 | 94.66 | 95.68 |
| Mod | Moderate slope | 65 | 60 | 4.45 | 4.11 |
| Sev | Severe slope | 13 | 3 | 0.89 | 0.21 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| Neighborhood | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Blmngtn | Bloomington Heights | 17 | 11 | 1.16 | 0.75 |
| Blueste | Bluestem | 2 | 8 | 0.14 | 0.55 |
| BrDale | Briardale | 16 | 14 | 1.10 | 0.96 |
| BrkSide | Brookside | 58 | 50 | 3.97 | 3.43 |
| ClearCr | Clear Creek | 28 | 16 | 1.92 | 1.10 |
| CollgCr | College Creek | 150 | 117 | 10.27 | 8.02 |
| Crawfor | Crawford | 51 | 52 | 3.49 | 3.56 |
| Edwards | Edwards | 100 | 94 | 6.85 | 6.44 |
| Gilbert | Gilbert | 79 | 86 | 5.41 | 5.89 |
| IDOTRR | Iowa DOT and Rail Road | 37 | 56 | 2.53 | 3.84 |
| MeadowV | Meadow Village | 17 | 20 | 1.16 | 1.37 |
| Mitchel | Mitchell | 49 | 65 | 3.36 | 4.46 |
| NAmes | North Ames | 225 | 218 | 15.41 | 14.94 |
| NPkVill | Northpark Villa | 9 | 14 | 0.62 | 0.96 |
| NWAmes | Northwest Ames | 73 | 58 | 5.00 | 3.98 |
| NoRidge | Northridge | 41 | 30 | 2.81 | 2.06 |
| NridgHt | Northridge Heights | 77 | 89 | 5.27 | 6.10 |
| OldTown | Old Town | 113 | 126 | 7.74 | 8.64 |
| SWISU | South & West of Iowa State University | 25 | 23 | 1.71 | 1.58 |
| Sawyer | Sawyer | 74 | 77 | 5.07 | 5.28 |
| SawyerW | Sawyer West | 59 | 66 | 4.04 | 4.52 |
| Somerst | Somerset | 86 | 96 | 5.89 | 6.58 |
| StoneBr | Stone Brook | 25 | 26 | 1.71 | 1.78 |
| Timber | Timberland | 38 | 34 | 2.60 | 2.33 |
| Veenker | Veenker | 11 | 13 | 0.75 | 0.89 |
| Total | Total | 1460 | 1459 | 99.98 | 100.01 |
| Condition1 | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Artery | Adjacent to arterial street | 48 | 44 | 3.29 | 3.02 |
| Feedr | Adjacent to feeder street | 81 | 83 | 5.55 | 5.69 |
| Norm | Normal | 1260 | 1251 | 86.30 | 85.74 |
| PosA | Adjacent to postive off-site feature | 8 | 12 | 0.55 | 0.82 |
| PosN | Near positive off-site feature–park | 19 | 20 | 1.30 | 1.37 |
| RRAe | Adjacent to East-West Railroad | 11 | 17 | 0.75 | 1.17 |
| RRAn | Adjacent to North-South Railroad | 26 | 24 | 1.78 | 1.64 |
| RRNe | Within 200’ of East-West Railroad | 2 | 4 | 0.14 | 0.27 |
| RRNn | Within 200’ of North-South Railroad | 5 | 4 | 0.34 | 0.27 |
| Total | Total | 1460 | 1459 | 100.00 | 99.99 |
| Condition2 | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Artery | Adjacent to arterial street | 2 | 3 | 0.14 | 0.21 |
| Feedr | Adjacent to feeder street | 6 | 7 | 0.41 | 0.48 |
| Norm | Normal | 1445 | 1444 | 98.97 | 98.97 |
| PosA | Adjacent to postive off-site feature | 1 | 3 | 0.07 | 0.21 |
| PosN | Near positive off-site feature–park | 2 | 2 | 0.14 | 0.14 |
| RRAe | Adjacent to East-West Railroad | 1 | 0 | 0.07 | 0.00 |
| RRAn | Adjacent to North-South Railroad | 1 | 0 | 0.07 | 0.00 |
| RRNn | Within 200’ of North-South Railroad | 2 | 0 | 0.14 | 0.00 |
| Total | Total | 1460 | 1459 | 100.01 | 100.01 |
| BldgType | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| 1Fam | Single-family Detached | 1220 | 1205 | 83.56 | 82.59 |
| 2fmCon | Two-family Conversion; originally built as one-family dwelling | 31 | 31 | 2.12 | 2.12 |
| Duplex | Duplex | 52 | 57 | 3.56 | 3.91 |
| Twnhs | Townhouse Inside Unit | 43 | 53 | 2.95 | 3.63 |
| TwnhsE | Townhouse End Unit | 114 | 113 | 7.81 | 7.75 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| HouseStyle | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| 1.5Fin | One and one-half story: 2nd level finished | 154 | 160 | 10.55 | 10.97 |
| 1.5Unf | One and one-half story: 2nd level unfinished | 14 | 5 | 0.96 | 0.34 |
| 1Story | One story | 726 | 745 | 49.73 | 51.06 |
| 2.5Fin | Two and one-half story: 2nd level finished | 8 | 0 | 0.55 | 0.00 |
| 2.5Unf | Two and one-half story: 2nd level unfinished | 11 | 13 | 0.75 | 0.89 |
| 2Story | Two story | 445 | 427 | 30.48 | 29.27 |
| SFoyer | Split Foyer | 37 | 46 | 2.53 | 3.15 |
| SLvl | Split Level | 65 | 63 | 4.45 | 4.32 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| OverallQual | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| 1 | Very Poor | 2 | 2 | 0.14 | 0.14 |
| 10 | Very Excellent | 18 | 13 | 1.23 | 0.89 |
| 2 | Poor | 3 | 10 | 0.21 | 0.69 |
| 3 | Fair | 20 | 20 | 1.37 | 1.37 |
| 4 | Below Average | 116 | 110 | 7.95 | 7.54 |
| 5 | Average | 397 | 428 | 27.19 | 29.34 |
| 6 | Above Average | 374 | 357 | 25.62 | 24.47 |
| 7 | Good | 319 | 281 | 21.85 | 19.26 |
| 8 | Very Good | 168 | 174 | 11.51 | 11.93 |
| 9 | Excellent | 43 | 64 | 2.95 | 4.39 |
| Total | Total | 1460 | 1459 | 100.02 | 100.02 |
| OverallCond | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| 1 | Very Poor | 1 | 6 | 0.07 | 0.41 |
| 2 | Poor | 5 | 5 | 0.34 | 0.34 |
| 3 | Fair | 25 | 25 | 1.71 | 1.71 |
| 4 | Below Average | 57 | 44 | 3.90 | 3.02 |
| 5 | Average | 821 | 824 | 56.23 | 56.48 |
| 6 | Above Average | 252 | 279 | 17.26 | 19.12 |
| 7 | Good | 205 | 185 | 14.04 | 12.68 |
| 8 | Very Good | 72 | 72 | 4.93 | 4.93 |
| 9 | Excellent | 22 | 19 | 1.51 | 1.30 |
| Total | Total | 1460 | 1459 | 99.99 | 99.99 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 1872 | 1954 | 1973 | 2000 | 2010 | 0 |
| test | 1879 | 1953 | 1973 | 2001 | 2010 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 1950 | 1967 | 1994 | 2004 | 2010 | 0 |
| test | 1950 | 1963 | 1992 | 2004 | 2010 | 0 |
## `geom_smooth()` using method = 'gam'
| RoofStyle | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Flat | Flat | 13 | 7 | 0.89 | 0.48 |
| Gable | Gable | 1141 | 1169 | 78.15 | 80.12 |
| Gambrel | Gabrel (Barn) | 11 | 11 | 0.75 | 0.75 |
| Hip | Hip | 286 | 265 | 19.59 | 18.16 |
| Mansard | Mansard | 7 | 4 | 0.48 | 0.27 |
| Shed | Shed | 2 | 3 | 0.14 | 0.21 |
| Total | Total | 1460 | 1459 | 100.00 | 99.99 |
| RoofMatl | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| ClyTile | Clay or Tile | 1 | 0 | 0.07 | 0.00 |
| CompShg | Standard (Composite) Shingle | 1434 | 1442 | 98.22 | 98.83 |
| Membran | Membrane | 1 | 0 | 0.07 | 0.00 |
| Metal | Metal | 1 | 0 | 0.07 | 0.00 |
| Roll | Roll | 1 | 0 | 0.07 | 0.00 |
| Tar&Grv | Gravel & Tar | 11 | 12 | 0.75 | 0.82 |
| WdShake | Wood Shakes | 5 | 4 | 0.34 | 0.27 |
| WdShngl | Wood Shingles | 6 | 1 | 0.41 | 0.07 |
| Total | Total | 1460 | 1459 | 100.00 | 99.99 |
| Exterior1st | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 0 | 1 | 0.00 | 0.07 |
| AsbShng | Asbestos Shingles | 20 | 24 | 1.37 | 1.64 |
| AsphShn | Asphalt Shingles | 1 | 1 | 0.07 | 0.07 |
| BrkComm | Brick Common | 2 | 4 | 0.14 | 0.27 |
| BrkFace | Brick Face | 50 | 37 | 3.42 | 2.54 |
| CBlock | Cinder Block | 1 | 1 | 0.07 | 0.07 |
| CemntBd | Cement Board | 61 | 65 | 4.18 | 4.46 |
| HdBoard | Hard Board | 222 | 220 | 15.21 | 15.08 |
| ImStucc | Imitation Stucco | 1 | 0 | 0.07 | 0.00 |
| MetalSd | Metal Siding | 220 | 230 | 15.07 | 15.76 |
| Plywood | Plywood | 108 | 113 | 7.40 | 7.75 |
| Stone | Stone | 2 | 0 | 0.14 | 0.00 |
| Stucco | Stucco | 25 | 18 | 1.71 | 1.23 |
| VinylSd | Vinyl Siding | 515 | 510 | 35.27 | 34.96 |
| Wd Sdng | Wood Siding | 206 | 205 | 14.11 | 14.05 |
| WdShing | Wood Shingles | 26 | 30 | 1.78 | 2.06 |
| Total | Total | 1460 | 1459 | 100.01 | 100.01 |
| Exterior2nd | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 0 | 1 | 0.00 | 0.07 |
| AsbShng | Asbestos Shingles | 20 | 18 | 1.37 | 1.23 |
| AsphShn | Asphalt Shingles | 3 | 1 | 0.21 | 0.07 |
| Brk Cmn | Brick Common | 7 | 15 | 0.48 | 1.03 |
| BrkFace | Brick Face | 25 | 22 | 1.71 | 1.51 |
| CBlock | Cinder Block | 1 | 2 | 0.07 | 0.14 |
| CmentBd | Cement Board | 60 | 66 | 4.11 | 4.52 |
| HdBoard | Hard Board | 207 | 199 | 14.18 | 13.64 |
| ImStucc | Imitation Stucco | 10 | 5 | 0.68 | 0.34 |
| MetalSd | Metal Siding | 214 | 233 | 14.66 | 15.97 |
| Other | Other | 1 | 0 | 0.07 | 0.00 |
| Plywood | Plywood | 142 | 128 | 9.73 | 8.77 |
| Stone | Stone | 5 | 1 | 0.34 | 0.07 |
| Stucco | Stucco | 26 | 21 | 1.78 | 1.44 |
| VinylSd | Vinyl Siding | 504 | 510 | 34.52 | 34.96 |
| Wd Sdng | Wood Siding | 197 | 194 | 13.49 | 13.30 |
| Wd Shng | Wood Shingles | 38 | 43 | 2.60 | 2.95 |
| Total | Total | 1460 | 1459 | 100.00 | 100.01 |
| MasVnrType | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 8 | 16 | 0.55 | 1.10 |
| BrkCmn | Brick Common | 15 | 10 | 1.03 | 0.69 |
| BrkFace | Brick Face | 445 | 434 | 30.48 | 29.75 |
| None | None | 864 | 878 | 59.18 | 60.18 |
| Stone | Stone | 128 | 121 | 8.77 | 8.29 |
| Total | Total | 1460 | 1459 | 100.01 | 100.01 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 166 | 1600 | 8 |
| test | 0 | 0 | 0 | 164 | 1290 | 15 |
## `geom_smooth()` using method = 'gam'
| ExterQual | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Ex | Excellent | 52 | 55 | 3.56 | 3.77 |
| Fa | Fair | 14 | 21 | 0.96 | 1.44 |
| Gd | Good | 488 | 491 | 33.42 | 33.65 |
| TA | Average/Typical | 906 | 892 | 62.05 | 61.14 |
| Total | Total | 1460 | 1459 | 99.99 | 100.00 |
| ExterCond | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Ex | Excellent | 3 | 9 | 0.21 | 0.62 |
| Fa | Fair | 28 | 39 | 1.92 | 2.67 |
| Gd | Good | 146 | 153 | 10.00 | 10.49 |
| Po | Poor | 1 | 2 | 0.07 | 0.14 |
| TA | Average/Typical | 1282 | 1256 | 87.81 | 86.09 |
| Total | Total | 1460 | 1459 | 100.01 | 100.01 |
| Foundation | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| BrkTil | Brick & Tile | 146 | 165 | 10.00 | 11.31 |
| CBlock | Cinder Block | 634 | 601 | 43.42 | 41.19 |
| PConc | Poured Contrete | 647 | 661 | 44.32 | 45.31 |
| Slab | Slab | 24 | 25 | 1.64 | 1.71 |
| Stone | Stone | 6 | 5 | 0.41 | 0.34 |
| Wood | Wood | 3 | 2 | 0.21 | 0.14 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| BsmtQual | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Basement | 37 | 44 | 2.53 | 3.02 |
| Ex | Excellent (100+ inches) | 121 | 137 | 8.29 | 9.39 |
| Fa | Fair (70-79 inches) | 35 | 53 | 2.40 | 3.63 |
| Gd | Good (90-99 inches) | 618 | 591 | 42.33 | 40.51 |
| TA | Typical (80-89 inches) | 649 | 634 | 44.45 | 43.45 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| BsmtCond | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Basement | 37 | 45 | 2.53 | 3.08 |
| Fa | Fair - dampness or some cracking or settling | 45 | 59 | 3.08 | 4.04 |
| Gd | Good | 65 | 57 | 4.45 | 3.91 |
| Po | Poor - Severe cracking | 2 | 3 | 0.14 | 0.21 |
| TA | Typical - slight dampness allowed | 1311 | 1295 | 89.79 | 88.76 |
| Total | Total | 1460 | 1459 | 99.99 | 100.00 |
| BsmtExposure | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Basement | 38 | 44 | 2.60 | 3.02 |
| Av | Average Exposure (split levels or foyers typically score average or above) | 221 | 197 | 15.14 | 13.50 |
| Gd | Good Exposure | 134 | 142 | 9.18 | 9.73 |
| Mn | Mimimum Exposure | 114 | 125 | 7.81 | 8.57 |
| No | No Exposure | 953 | 951 | 65.27 | 65.18 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| BsmtFinType1 | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Basement | 37 | 42 | 2.53 | 2.88 |
| ALQ | Average Living Quarters | 220 | 209 | 15.07 | 14.32 |
| BLQ | Below Average Living Quarters | 148 | 121 | 10.14 | 8.29 |
| GLQ | Good Living Quarters | 418 | 431 | 28.63 | 29.54 |
| LwQ | Low Quality | 74 | 80 | 5.07 | 5.48 |
| Rec | Average Rec Room | 133 | 155 | 9.11 | 10.62 |
| Unf | Unfinshed | 430 | 421 | 29.45 | 28.86 |
| Total | Total | 1460 | 1459 | 100.00 | 99.99 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 383.5 | 712.25 | 5644 | 0 |
| test | 0 | 0 | 350.5 | 753.50 | 4010 | 1 |
## `geom_smooth()` using method = 'gam'
| BsmtFinType2 | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Basement | 38 | 42 | 2.60 | 2.88 |
| ALQ | Average Living Quarters | 19 | 33 | 1.30 | 2.26 |
| BLQ | Below Average Living Quarters | 33 | 35 | 2.26 | 2.40 |
| GLQ | Good Living Quarters | 14 | 20 | 0.96 | 1.37 |
| LwQ | Low Quality | 46 | 41 | 3.15 | 2.81 |
| Rec | Average Rec Room | 54 | 51 | 3.70 | 3.50 |
| Unf | Unfinshed | 1256 | 1237 | 86.03 | 84.78 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 1474 | 0 |
| test | 0 | 0 | 0 | 0 | 1526 | 1 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 223.00 | 477.5 | 808.00 | 2336 | 0 |
| test | 0 | 219.25 | 460.0 | 797.75 | 2140 | 1 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 795.75 | 991.5 | 1298.25 | 6110 | 0 |
| test | 0 | 784.00 | 988.0 | 1305.00 | 5095 | 1 |
## `geom_smooth()` using method = 'gam'
| Heating | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Floor | Floor Furnace | 1 | 0 | 0.07 | 0.00 |
| GasA | Gas forced warm air furnace | 1428 | 1446 | 97.81 | 99.11 |
| GasW | Gas hot water or steam heat | 18 | 9 | 1.23 | 0.62 |
| Grav | Gravity furnace | 7 | 2 | 0.48 | 0.14 |
| OthW | Hot water or steam heat other than gas | 2 | 0 | 0.14 | 0.00 |
| Wall | Wall furnace | 4 | 2 | 0.27 | 0.14 |
| Total | Total | 1460 | 1459 | 100.00 | 100.01 |
| HeatingQC | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Ex | Excellent | 741 | 752 | 50.75 | 51.54 |
| Fa | Fair | 49 | 43 | 3.36 | 2.95 |
| Gd | Good | 241 | 233 | 16.51 | 15.97 |
| Po | Poor | 1 | 2 | 0.07 | 0.14 |
| TA | Average/Typical | 428 | 429 | 29.32 | 29.40 |
| Total | Total | 1460 | 1459 | 100.01 | 100.00 |
| CentralAir | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| N | No | 95 | 101 | 6.51 | 6.92 |
| Y | Yes | 1365 | 1358 | 93.49 | 93.08 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| Electrical | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 1 | 0 | 0.07 | 0.00 |
| FuseA | Fuse Box over 60 AMP and all Romex wiring (Average) | 94 | 94 | 6.44 | 6.44 |
| FuseF | 60 AMP Fuse Box and mostly Romex wiring (Fair) | 27 | 23 | 1.85 | 1.58 |
| FuseP | 60 AMP Fuse Box and mostly knob & tube wiring (poor) | 3 | 5 | 0.21 | 0.34 |
| Mix | Mixed | 1 | 0 | 0.07 | 0.00 |
| SBrkr | Standard Circuit Breakers & Romex | 1334 | 1337 | 91.37 | 91.64 |
| Total | Total | 1460 | 1459 | 100.01 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 334 | 882.0 | 1087 | 1391.25 | 4692 | 0 |
| test | 407 | 873.5 | 1079 | 1382.50 | 5095 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 728 | 2065 | 0 |
| test | 0 | 0 | 0 | 676 | 1862 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 572 | 0 |
| test | 0 | 0 | 0 | 0 | 1064 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 334 | 1129.5 | 1464 | 1776.75 | 5642 | 0 |
| test | 407 | 1117.5 | 1432 | 1721.00 | 5095 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 1 | 3 | 0 |
| test | 0 | 0 | 0 | 1 | 3 | 2 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 2 | 0 |
| test | 0 | 0 | 0 | 0 | 2 | 2 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 1 | 2 | 2 | 3 | 0 |
| test | 0 | 1 | 2 | 2 | 4 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 1 | 2 | 0 |
| test | 0 | 0 | 0 | 1 | 2 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 2 | 3 | 3 | 8 | 0 |
| test | 0 | 2 | 3 | 3 | 6 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 1 | 1 | 1 | 3 | 0 |
| test | 0 | 1 | 1 | 1 | 2 | 0 |
## `geom_smooth()` using method = 'gam'
| KitchenQual | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 0 | 1 | 0.00 | 0.07 |
| Ex | Excellent | 100 | 105 | 6.85 | 7.20 |
| Fa | Fair | 39 | 31 | 2.67 | 2.12 |
| Gd | Good | 586 | 565 | 40.14 | 38.73 |
| TA | Average/Typical | 735 | 757 | 50.34 | 51.88 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 2 | 5 | 6 | 7 | 14 | 0 |
| test | 3 | 5 | 6 | 7 | 15 | 0 |
## `geom_smooth()` using method = 'gam'
| Functional | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 0 | 2 | 0.00 | 0.14 |
| Maj1 | Major Deductions 1 | 14 | 5 | 0.96 | 0.34 |
| Maj2 | Major Deductions 2 | 5 | 4 | 0.34 | 0.27 |
| Min1 | Minor Deductions 1 | 31 | 34 | 2.12 | 2.33 |
| Min2 | Minor Deductions 2 | 34 | 36 | 2.33 | 2.47 |
| Mod | Moderate Deductions | 15 | 20 | 1.03 | 1.37 |
| Sev | Severely Damaged | 1 | 1 | 0.07 | 0.07 |
| Typ | Typical Functionality | 1360 | 1357 | 93.15 | 93.01 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 1 | 1 | 3 | 0 |
| test | 0 | 0 | 0 | 1 | 4 | 0 |
## `geom_smooth()` using method = 'gam'
| FireplaceQu | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Fireplace | 690 | 730 | 47.26 | 50.03 |
| Ex | Excellent - Exceptional Masonry Fireplace | 24 | 19 | 1.64 | 1.30 |
| Fa | Fair - Prefabricated Fireplace in basement | 33 | 41 | 2.26 | 2.81 |
| Gd | Good - Masonry Fireplace in main level | 380 | 364 | 26.03 | 24.95 |
| Po | Poor - Ben Franklin Stove | 20 | 26 | 1.37 | 1.78 |
| TA | Average - Prefabricated Fireplace in main living area or Masonry Fireplace in basement | 313 | 279 | 21.44 | 19.12 |
| Total | Total | 1460 | 1459 | 100.00 | 99.99 |
| GarageType | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Garage | 81 | 76 | 5.55 | 5.21 |
| 2Types | More than one type of garage | 6 | 17 | 0.41 | 1.17 |
| Attchd | Attached to home | 870 | 853 | 59.59 | 58.46 |
| Basment | Basement Garage | 19 | 17 | 1.30 | 1.17 |
| BuiltIn | Built-In (Garage part of house - typically has room above garage) | 88 | 98 | 6.03 | 6.72 |
| CarPort | Car Port | 9 | 6 | 0.62 | 0.41 |
| Detchd | Detached from home | 387 | 392 | 26.51 | 26.87 |
| Total | Total | 1460 | 1459 | 100.01 | 100.01 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 1900 | 1961 | 1980 | 2002 | 2010 | 81 |
| test | 1895 | 1959 | 1979 | 2002 | 2207 | 78 |
## `geom_smooth()` using method = 'gam'
| GarageFinish | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Garage | 81 | 78 | 5.55 | 5.35 |
| Fin | Finished | 352 | 367 | 24.11 | 25.15 |
| RFn | Rough Finished | 422 | 389 | 28.90 | 26.66 |
| Unf | Unfinished | 605 | 625 | 41.44 | 42.84 |
| Total | Total | 1460 | 1459 | 100.00 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 1 | 2 | 2 | 4 | 0 |
| test | 0 | 1 | 2 | 2 | 5 | 1 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 334.5 | 480 | 576 | 1418 | 0 |
| test | 0 | 318.0 | 480 | 576 | 1488 | 1 |
## `geom_smooth()` using method = 'gam'
| GarageQual | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Garage | 81 | 78 | 5.55 | 5.35 |
| Ex | Excellent | 3 | 0 | 0.21 | 0.00 |
| Fa | Fair | 48 | 76 | 3.29 | 5.21 |
| Gd | Good | 14 | 10 | 0.96 | 0.69 |
| Po | Poor | 3 | 2 | 0.21 | 0.14 |
| TA | Average/Typical | 1311 | 1293 | 89.79 | 88.62 |
| Total | Total | 1460 | 1459 | 100.01 | 100.01 |
| GarageCond | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Garage | 81 | 78 | 5.55 | 5.35 |
| Ex | Excellent | 2 | 1 | 0.14 | 0.07 |
| Fa | Fair | 35 | 39 | 2.40 | 2.67 |
| Gd | Good | 9 | 6 | 0.62 | 0.41 |
| Po | Poor | 7 | 7 | 0.48 | 0.48 |
| TA | Average/Typical | 1326 | 1328 | 90.82 | 91.02 |
| Total | Total | 1460 | 1459 | 100.01 | 100.00 |
| PavedDrive | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| N | Dirt/Gravel | 90 | 126 | 6.16 | 8.64 |
| P | Partial Pavement | 30 | 32 | 2.05 | 2.19 |
| Y | Paved | 1340 | 1301 | 91.78 | 89.17 |
| Total | Total | 1460 | 1459 | 99.99 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 168 | 857 | 0 |
| test | 0 | 0 | 0 | 168 | 1424 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 25 | 68 | 547 | 0 |
| test | 0 | 0 | 28 | 72 | 742 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 552 | 0 |
| test | 0 | 0 | 0 | 0 | 1012 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 508 | 0 |
| test | 0 | 0 | 0 | 0 | 360 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 480 | 0 |
| test | 0 | 0 | 0 | 0 | 576 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 738 | 0 |
| test | 0 | 0 | 0 | 0 | 800 | 0 |
## `geom_smooth()` using method = 'gam'
| PoolQC | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Pool | 1453 | 1456 | 99.52 | 99.79 |
| Ex | Excellent | 2 | 2 | 0.14 | 0.14 |
| Fa | Fair | 2 | 0 | 0.14 | 0.00 |
| Gd | Good | 3 | 1 | 0.21 | 0.07 |
| Total | Total | 1460 | 1459 | 100.01 | 100.00 |
| Fence | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | No Fence | 1179 | 1169 | 80.75 | 80.12 |
| GdPrv | Good Privacy | 59 | 59 | 4.04 | 4.04 |
| GdWo | Good Wood | 54 | 58 | 3.70 | 3.98 |
| MnPrv | Minimum Privacy | 157 | 172 | 10.75 | 11.79 |
| MnWw | Minimum Wood/Wire | 11 | 1 | 0.75 | 0.07 |
| Total | Total | 1460 | 1459 | 99.99 | 100.00 |
| MiscFeature | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | None | 1406 | 1408 | 96.30 | 96.50 |
| Gar2 | 2nd Garage (if not described in garage section) | 2 | 3 | 0.14 | 0.21 |
| Othr | Other | 2 | 2 | 0.14 | 0.14 |
| Shed | Shed (over 100 SF) | 49 | 46 | 3.36 | 3.15 |
| TenC | Tennis Court | 1 | 0 | 0.07 | 0.00 |
| Total | Total | 1460 | 1459 | 100.01 | 100.00 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 0 | 0 | 0 | 0 | 15500 | 0 |
| test | 0 | 0 | 0 | 0 | 17000 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 1 | 5 | 6 | 8 | 12 | 0 |
| test | 1 | 4 | 6 | 8 | 12 | 0 |
## `geom_smooth()` using method = 'gam'
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 2006 | 2007 | 2008 | 2009 | 2010 | 0 |
| test | 2006 | 2007 | 2008 | 2009 | 2010 | 0 |
## `geom_smooth()` using method = 'gam'
| SaleType | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| NA | NA | 0 | 1 | 0.00 | 0.07 |
| COD | Court Officer Deed/Estate | 43 | 44 | 2.95 | 3.02 |
| CWD | Warranty Deed - Cash | 4 | 8 | 0.27 | 0.55 |
| Con | Contract 15% Down payment regular terms | 2 | 3 | 0.14 | 0.21 |
| ConLD | Contract Low Down | 9 | 17 | 0.62 | 1.17 |
| ConLI | Contract Low Interest | 5 | 4 | 0.34 | 0.27 |
| ConLw | Contract Low Down payment and low interest | 5 | 3 | 0.34 | 0.21 |
| New | Home just constructed and sold | 122 | 117 | 8.36 | 8.02 |
| Oth | Other | 3 | 4 | 0.21 | 0.27 |
| WD | Warranty Deed - Conventional | 1267 | 1258 | 86.78 | 86.22 |
| Total | Total | 1460 | 1459 | 100.01 | 100.01 |
| SaleCondition | Libelle | Eff. train | Eff. test | % train | % test |
|---|---|---|---|---|---|
| Abnorml | Abnormal Sale - trade | 101 | 89 | 6.92 | 6.10 |
| AdjLand | Adjoining Land Purchase | 4 | 8 | 0.27 | 0.55 |
| Alloca | Allocation - two linked properties with separate deeds | 12 | 12 | 0.82 | 0.82 |
| Family | Sale between family members | 20 | 26 | 1.37 | 1.78 |
| Normal | Normal Sale | 1198 | 1204 | 82.05 | 82.52 |
| Partial | Home was not completed when last assessed (associated with New Homes) | 125 | 120 | 8.56 | 8.22 |
| Total | Total | 1460 | 1459 | 99.99 | 99.99 |
| set | Min. | 1st Qu. | Median | 3rd Qu. | Max. | NA’s |
|---|---|---|---|---|---|---|
| train | 34900 | 129975 | 163000 | 214000 | 755000 | 0 |
| test | Inf | NA | NA | NA | -Inf | 1459 |
## `geom_smooth()` using method = 'gam'